The Impact of Using Combinatorial Optimisation for Static Caching of Posting Lists

نویسندگان

  • Casper Petersen
  • Jakob Grue Simonsen
  • Christina Lioma
چکیده

Caching posting lists can reduce the amount of disk I/O required to evaluate a query. Current methods use optimisation procedures for maximising the cache hit ratio. A recent method selects posting lists for static caching in a greedy manner and obtains higher hit rates than standard cache eviction policies such as LRU and LFU. However, a greedy method does not formally guarantee an optimal solution. We investigate whether the use of methods guaranteed, in theory, to find an approximately optimal solution would yield higher hit rates. Thus, we cast the selection of posting lists for caching as an integer linear programming problem and perform a series of experiments using heuristics from combinatorial optimisation (CCO) to find optimal solutions. Using simulated query logs we find that CCO yields comparable results to a greedy baseline using cache sizes between 200 and 1000 MB, with modest improvements for queries of length two to three.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A five-level static cache architecture for web search engines

Caching is a crucial performance component of large-scale web search engines, as it greatly helps reducing average query response times and query processing workloads on backend search clusters. In this paper, we describe a multi-level static cache architecture that stores five different item types: query results, precomputed scores, posting lists, precomputed intersections of posting lists, an...

متن کامل

Location and Capacity of UPFC to Improve the Operation Power System Using a Grasshopper Optimisation Algorithm

Unified Power Flow Controller (UPFC) is one of the most efficient FACTS devices that can be individually or combinatorial on all effective parameters in the transmission power of the transmission lines. With the flexibility of effective parameters in power passage, the power system operation can be improved. in this study, the optimization algorithm based on location and optimal capacity of UPF...

متن کامل

Improve Replica Placement in Content Distribution Networks with Hybrid Technique

The increased using of the Internet and its accelerated growth leads to reduced network bandwidth and the capacity of servers; therefore, the quality of Internet services is unacceptable for users while the efficient and effective delivery of content on the web has an important role to play in improving performance. Content distribution networks were introduced to address this issue. Replicatin...

متن کامل

Distributed search based on self-indexed compressed text

Query response times within a fraction of a second in Web search engines are feasible due to the use of indexing and caching techniques, which are devised for large text collections partitioned and replicated into a set of distributed memory processors. This paper proposes an alternative query processing method for this setting, which is based on a combination of self-indexed compressed text an...

متن کامل

بهینهجسازی لایهجچینی ورقجهای کامپوزیتی تحت بار ضربهجای کوبش با بهرهجگیری از روش الگوریتم ژنتیک

Optimisation of stacking sequence for composite plates under slamming impact loads using genetic algorithm method is studied in this paper. For this purpose, slamming load is assumed to have a uniform distribution with a triangular-pulse type of intensity function. In order to perform optimisation based on the genetic algorithm method, a special code is written in MATLAB software environment. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015